Large Scale Explorative Oligonucleotide Probe Selection for Thousands of Genetic Groups on a Computing Grid: Application to Phylogenetic Probe Design Using a Curated Small Subunit Ribosomal RNA Gene Database
نویسندگان
چکیده
Phylogenetic Oligonucleotide Arrays (POAs) were recently adapted for studying the huge microbial communities in a flexible and easy-to-use way. POA coupled with the use of explorative probes to detect the unknown part is now one of the most powerful approaches for a better understanding of microbial community functioning. However, the selection of probes remains a very difficult task. The rapid growth of environmental databases has led to an exponential increase of data to be managed for an efficient design. Consequently, the use of high performance computing facilities is mandatory. In this paper, we present an efficient parallelization method to select known and explorative oligonucleotide probes at large scale using computing grids. We implemented a software that generates and monitors thousands of jobs over the European Computing Grid Infrastructure (EGI). We also developed a new algorithm for the construction of a high-quality curated phylogenetic database to avoid erroneous design due to bad sequence affiliation. We present here the performance and statistics of our method on real biological datasets based on a phylogenetic prokaryotic database at the genus level and a complete design of about 20,000 probes for 2,069 genera of prokaryotes.
منابع مشابه
The SILVA ribosomal RNA gene database project: improved data processing and web-based tools
SILVA (from Latin silva, forest, http://www.arb-silva.de) is a comprehensive web resource for up to date, quality-controlled databases of aligned ribosomal RNA (rRNA) gene sequences from the Bacteria, Archaea and Eukaryota domains and supplementary online services. The referred database release 111 (July 2012) contains 3 194 778 small subunit and 288 717 large subunit rRNA gene sequences. Since...
متن کاملprobeBase: an online resource for rRNA-targeted oligonucleotide probes
Ribosomal RNA-(rRNA)-targeted oligonucleotide probes are widely used for culture-independent identification of microorganisms in environmental and clinical samples. ProbeBase is a comprehensive database containing more than 700 published rRNA-targeted oligonucleotide probe sequences (status August 2002) with supporting bibliographic and biological annotation that can be accessed through the int...
متن کاملPhylOPDb: a 16S rRNA oligonucleotide probe database for prokaryotic identification
In recent years, high-throughput molecular tools have led to an exponential growth of available 16S rRNA gene sequences. Incorporating such data, molecular tools based on target-probe hybridization were developed to monitor microbial communities within complex environments. Unfortunately, only a few 16S rRNA gene-targeted probe collections were described. Here, we present PhylOPDb, an online re...
متن کاملDevelopment and application of the human intestinal tract chip, a phylogenetic microarray: analysis of universally conserved phylotypes in the abundant microbiota of young and elderly adults
In this paper we present the in silico assessment of the diversity of variable regions of the small subunit ribosomal RNA (SSU rRNA) gene based on an ecosystem-specific curated database, describe a probe design procedure based on two hypervariable regions with minimal redundancy and test the potential of such probe design strategy for the design of a flexible microarray platform. This resulted ...
متن کاملThe oligonucleotide probe database.
The use of oligonucleotide hybridization probes and PCR primers has become widespread in microbial ecology and environmental microbiology (for reviews, see references 3, 5, 7, 17, and 21), and descriptions of probe applications are abundant in the literature. We have encountered, however, a number of difficulties when relying on the literature for information on probes and primers: (i) probe de...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره 2014 شماره
صفحات -
تاریخ انتشار 2014